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ABSTRACT Chemical denaturant sensitivity of the dimeric main protease from severe acute respiratory syndrome (SARS) 
coronavirus to guanidinium chloride was examined in terms of fluorescence spectroscopy, circular dichroism, analytical ultra- 
centrifuge, and enzyme activity change. The dimeric enzyme dissociated at guanidinium chloride concentration of <0.4 M, at 
which the enzymatic activity loss showed close correlation with the subunit dissociation. Further increase in guanidinium chlo- 
ride induced a reversible biphasic unfolding of the enzyme. The unfolding of the C-terminal domain-truncated enzyme, on the 
other hand, followed a monophasic unfolding curve. Different mutants of the full-length protease (W31 and W207/W218), with 
tryptophanyl residue(s) mutated to phenylalanine at the C-terminal or N-terminal domain, respectively, were constructed. Un- 
folding curves of these mutants were monophasic but corresponded to the first and second phases of the protease, re- 
spectively. The unfolding intermediate of the protease thus represented a folded C-terminal domain but an unfolded N-terminal 
domain, which is enzymatically inactive due to loss of regulatory properties. The various enzyme forms were characterized 
in terms of hydrophobicity and size-and-shape distributions. We provide direct evidence for the functional role of C-terminal 


domain in stabilization of the catalytic N-terminal domain of SARS coronavirus main protease. 


INTRODUCTION 


Severe acute respiratory syndrome (SARS) was one of the 
most serious epidemic diseases in 2003 and is still a threat to 
the public health worldwide (1). A specific SARS corona- 
virus (CoV) was identified to be the contagious agent. Mat- 
uration of the SARS-CoV involves proteolytic processing of 
the virus polyproteins by the chymotrypsin-like main pro- 
tease (3CLpro) and a papain-like protease (PLpro). SARS- 
CoV might invade cells by a protease-mediated endosomal 
pathway (2). Any knowledge about the structural stability of 
viral protease can be very rewarding for the development of 
antivirus drugs (3-6). 

The 3CLpro from SARS-CoV was the first protein of this 
virus to have its three-dimensional structure solved (7,8). As 
had previously been shown for the 3CLpro of transmissible 
gastroenteritis (corona)virus (9), the SARS-CoV enzyme 
also exists as a dimer in solution and in crystals, with each 
subunit containing three structural domains (Fig. 1). The 
catalytic dyad His*!-Cys'° is located between the interfaces 
of domains I (residues 8-99) and domain II (residues 100— 
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183), which constitute the chymotrypsin-like folding scaf- 
fold. This catalytic N-terminal domain is connected by a long 
loop (residues 184-199) to the helical domain III (residues 
200-306). The functional role of the extra C-terminal do- 
main III is less clear. Numerous experimental evidence 
shows the importance of N- and C-terminal residues in the 
dimerization (4,10-12). Structural studies indicate the close 
proximity of C-terminus and N-finger (N-terminal residues 
1-7) (9) from subunit A to the active site of subunit B that led 
to the development of a novel autocleavage mechanism that 
can explain the dimeric nature of the mature enzyme (13). 

Here we demonstrate that the SARS main protease 
provides us a very useful model for the folding studies of 
multisubunit, multidomain proteins. Novel folding interme- 
diates were identified, which allow us to delineate unequiv- 
ocally the functional role of domain III. 


MATERIALS AND METHODS 


Construction of 3CLpro expression vectors 


The genes of the full-length 3CLpro and C-terminal truncated 3CL,,7; of the 
SARS-CoV main protease were amplified by polymerase chain reaction 
(PCR) with appropriate primers. The forward primers for full length was 
5'-GGTGGTCATATGAGTGGTTTTAGG-3’, and the reverse primer was 
5'-AACTCGAGGGTAACACCAGAG-3’. After digestion with Bg/II and 
Xhol, the PCR product was divided into two fragments, 168-bp and 747-bp. 
The 168-bp fragment was then digested with NdeI. Finally, the 168-bp NdeI- 
BglII and 747-bp Beglll-Xhol fragments were coligated to the NdeI and Xhol 
sites of the vector pET-29a(+) (Novagen, Madison, WI). The 3CLy+1 was 
subsequently amplified by PCR, using the forward primer of 5’-TGAAG- 
ATCTGCTCATTCGCAA-3’, and reverse primer of 5'-AACTCGACTAT- 
GGTTGTGTCTG-3’. After digestion with Bg/II and Xhol, the PCR products 
were inserted into the Bg/II and Xhol sites of the pET-SARS main protease. 
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FIGURE 1 Structural features of SARS-CoV main protease. (A) The three 
structural domains of SARS-CoV main protease are shown in rainbow color 
(Protein Data Bank code: 1Z11). The catalytic dyad His*!-Cys'*° between 
domains I and II are highlighted with red side chain. The important residues 
Tyr'®! and His'® in the substrate-binding pocket are shown in blue side 
chain. The tryptophanyl residues Trp*!, Trp”°’, and Trp*!® are shown in 
green side chain. Other two residues Ser® and Asn* located at surface are 
also shown. (B) Surface model showing the unsymmetrical dimer associ- 
ation mode. N-terminal domain and C-terminal domain of subunit A are in 
blue and light blue, respectively. Generate with Pymol (37). 


Site-directed mutagenesis (14) was used to construct pET-H41A, pET-S65A, 
pET-N84A, pET-C145A, pET-Y161A, and pET-H163A vectors. The for- 
ward primers for H41A, S65A, N84A, C145A, Y161A, and H163A were 
5'-CTGTCCAAGAGCGGTCATTTGCAC-3’, 5’-CAAATCCAACCAT- 
GCGTTTCTTGTTC-3’, 5'-CATTCTATGCAAGCGTGTCTGCTTAG-3’, 
5'-CTTAATGGATCAGCGGGTAGTGTTG-3’, 5'-GTCTTTCTGCGCG- 
ATGCATCATATG-3’, and 5’-CTGCTATATGGCGCATATGGAGC-3’, 
respectively; and the reverse primers were 5’-GTGCAAATGACCGC- 
TCTTGGACAG-3’, 5'-GAACAAGAAACGCATGGTTGGATTTG-3’, 
5'-CTAAGCAGACACGCTTGCATAGAATG-3’, 5'-CAACACTACCCG- 
CTGATCCATTAAG-3’, 5'-CATATGATGCATCGCGCAGAAAGAC-3’, 
and 5'-GCTCCATATGCGCCATATAGCAG-3’, respectively. The pET-3CL 
expression vectors were used as templates and the primers as described 
above were used to mutate the His", Ser®, Asn**, Cys!*, Tyr’, and 
His!® codons to alanine codon by PCR. The mutational sites are shown 


underlined and in bold. 
For the tryptophan mutants of 3CLpro, the forward primers for W31F, 
W207F, and W218F were 5'’-CTCTTAATGGATTGTTCTTGGATGACA- 
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CAGT-3',  5’-CCATAACATTAAATGTTTTGGCATTCCTGTATGCT- 
GC-3’, and 5'-CAATGGTGATAGGTTCTTTCTTAATAGATTCACC-3', 
respectively. The reverse primers were 5'-ACTGTGTCATCCAAGAA- 
CAATCCATTAAGAG-3’, 5’-GCAGCATACAGGAATGCCAAAACA- 
TITAATGTTATGG-3’, and 5'-GGTGAATCTATTAAAGAAGAACC- 
TATCACCATTG-3’, respectively. For W207F/W218F (W31) double 
mutant, W207F was used as the template. 

All nucleotide sequences were confirmed by autosequencing analysis. A 
sequence comparison bearing the mutation positions at the active site region 
is shown in supplemental Fig. 1S (Supplementary Material). The wild-type 
(WT) enzyme and all mutants were successfully expressed and purified to 
apparent homogeneity by protocols described previously (11). Typical CD 
spectra of the recombinant 3CLpro are shown in Supplementary Material 
(Fig. 2S). 


Enzyme activity assay of SARS-CoV main 
protease using fluorogenic substrate 


The enzyme kinetic measurements were performed in 30 mM Tris-Cl buffer 
(pH 7.7) or phosphate-buffered saline (pH 7.6) at 30°C. Enhanced fluo- 
rescence due to cleavage of the internal quenched fluorogenic substrate 
peptide (ortho-aminobenzoic acid-TSAVLQSGFRK-2,4-dinitropheny! am- 
ide) was monitored at 420 nm with excitation at 362 nm using a Perkin- 
Elmer LS-50B luminescence spectrometer. 

The kinetic parameters were obtained by assaying the enzyme activity at 
various peptide substrate concentrations around its Michaelis constant (Kin) 
(11). The initial velocities (v,) were measured and fitted to the Michaelis- 
Menten equation (Eq. 1): 


KealE} |S] 


Mo IS] (1) 


where kat is the catalytic constant; [E],, is the enzyme concentration, and [S], 
is the substrate concentration. 


Reversible unfolding/refolding of the enzyme 
in GdnCl 


WT (3CLpro) or domain III-deleted (3CL;47,) SARS-CoV main protease was 
unfolded with different concentrations of GdnCl in Tris-Cl buffer (30 mM, pH 
7.7) at 30°C for 10 min. The unfolding of the enzyme was monitored by 
fluorescence, CD, sedimentation velocity, and enzyme activity loss. The 
unfolding was found to reach equilibrium in transient. A 10-min incubation, 
both unfolding and refolding, was used in the experiments in this study. 


Spectrofluorimetric analysis 


Fluorescence spectra of the enzyme were monitored in a Perkin-Elmer LS- 
50B luminescence spectrometer at 30°C, and all spectra were corrected for 
buffer absorption. The excitation wavelength was set at 280 nm (295 nm for 
tryptophan mutants, W31 and W207/W218), and the fluorescence emission 
spectrum was scanned from 300 to 400 nm. The maximal peak of the 
fluorescence spectrum and the change in fluorescence intensity were used in 
monitoring the unfolding processes of the enzyme. Both the red shift and the 
changes in fluorescence intensity were analyzed together using the average 
emission wavelength (AEW) ({A)) according to Eq. 2 (15): 


(F, X Xi) 
Qy= (2) 


N 
YF, 


i=A| 


in which F; is the fluorescence intensity at the specific emission wavelength (A). 
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Exposure of the enzyme hydrophobic areas was examined by mixing the 
enzyme solution with 1-anilino-8-naphthalene sulfonic acid (ANS) and the 
fluorescence emission spectrum between 400 and 600 nm was recorded with 
excitation at 395 nm. 


Circular dichroism analysis 


CD spectra were measured at 30°C with a Jasco J-810 spectropolarimeter 

under constant N> flush. The enzyme solution was scanned from 300 to 190 

nm. When the analysis was completed, the buffer was immediately scanned 

to obtain the baseline. Unfolded enzyme solution was filtered through a 

0.22-mm filter (Whatman, UK) before analysis. Denaturing agent included 

in the buffer was used to monitor the baseline for the unfolded enzyme. 
Mean residue ellipticity [0] was calculated according to Eq. 3: 


Mx® 


ll = pox e xP 


(3) 


where / represents the light path length (dm); M, the average molecular 
weight of amino acids; C, the concentration of protein (mg/ml); and @, the 
observed ellipticity. 


Analytical ultracentrifugation 


Sedimentation velocity was performed in a Beckman-Coulter XL-A analy- 
tical ultracentrifuge with an AnSOTi rotor at 20°C and 42,000 rpm in 12-mm 
double-sector Epon charcoal-filled centerpieces. The ultraviolet (UV) ab- 
sorption of the cells at 280 nm was scanned in a continuous mode with time 
interval of 8 min and a step size of 0.003 cm. The partial specific volume 
of the enzyme, solvent density, and viscosity were calculated by the free 
software SEDNTERP (http://www.jphilo.mailway.com/). All samples were 
visually checked for clarity after ultracentrifugation to make sure that there 
was no indication of precipitation due to unfolding of the protein. Multiple 
scans at different time points were analyzed with the SEDFIT program 
(16,17). 

The size-and-shape distributions of the enzyme in the presence of GdnCl 
were analyzed with a new feature of SEDFIT (17) that allowed full ex- 
ploration of the protein conformational changes. The sedimentation velocity 
data were fitted using a two-dimensional distribution with respect to fric- 
tional ratio c(s, f/f,) according to Lamm equation (17,18) (Eq. 4): 


a(r, 1) = / / els.f/Alx(s:D(s.Flf)sritdsd f/f), (4) 


with a(r,f) denoting the observed optical signal at radius r and time f 
x(s,D,r,t), the solution of the Lamm equation; and D(s, f/f,), the dependence 
of diffusion coefficient (D) on sedimentation coefficient (s) and frictional 
ratio (f/f), where 
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with w denoting angular velocity; k, Boltzmann constant; T, absolute tem- 


D(s,f/fo) 


perature; », enzyme partial specific volume; 7, buffer viscosity; and p, buffer 
density. 

All two-dimensional distributions were solved and normalized to a con- 
fidence level of p = 0.95 by maximum entropy and a resolution N of 200 
with sedimentation coefficients between 0.1 and 20 S. The anhydrous fric- 
tion ratio is from 1.0 to 2.0 or 3.5 at a resolution of 10. 
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Unfolding data analysis 


Because unfolding/refolding is a reversible process (see Fig. 2), the un- 
folding data were treated with the following thermodynamic models by 
global fitting of the data to Eq. 7 or Eq. 8: 

The two-state unfolding model (Scheme I) was described by Eq. 7 (19): 
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The three-state unfolding model (Scheme II) was described by Eq. 8 (20): 
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FIGURE 2 Equilibrium unfolding/refolding of the SARS-CoV main 
protease in GdnCl. The enzyme (5 pg/ml for 3CLpro and 15 yg/ml for 
3CLi+n) in 30 mM Tris-Cl buffer (pH 7.7) was excited with 280 nm UV 
light and the protein intrinsic fluorescence emission spectra were monitored 
at 30°C. (A) 3CLpro; (B) 3CLy}+1. Curves are: (a) native enzyme in Tris-Cl 
buffer; (b) native enzyme in the same buffer containing 0.6 M GdnCl; (c) 
unfolded enzyme in 6 M GdnCl-buffer; (d) refolded enzyme in 0.6 M 
GdnCl-buffer. In curve d, the protein was unfolded in 6 M GdnCl-buffer for 
10 min and then diluted 10-fold with the buffer. All samples were treated 
with GdnCl-buffer or buffer-only simultaneously for the same duration. The 
final enzyme concentrations were the same for all spectra. 


SARS-CoV Main Protease 


7 (seen ee) 
yn tye 


t+yy-e. 


1377 


AG (4,0) ™N—1IGancl 46 (4,0), y I ULGancl 
=e RP ae sy 
-e 


Yoos = 


e 
~ eee, 
l+e 


where yop; is the observed biophysical signal; yx, yz, and yy are the calculated 
signals of the native, intermediate, and unfolded states, respectively. [GdnCl] 
is the GdnCl concentration, and AGy.; and AG;_y are the free energy changes 
for the NI and IU processes, respectively; my.; and my are the sen- 
sitivities of the respective unfolding process to denaturant concentration. 


RESULTS 
Active site region of SARS-CoV main protease 


The original crystal structure of SARS-CoV main protease 
revealed that the dimeric enzyme is half-sited at pH 6.0, in 
which one of the protomer appears to be active and the other 
inactive (7). In a recent article, Chen et al. (21) confirmed 
that the dimeric SARS-CoV main protease is asymmetric 
and only one protomer is active at a time. They proposed a 
novel association-activation-catalysis-dissociation mecha- 
nism for the enzyme activity control. We checked whether 
dissociation is the major reason for enzyme inactivation. The 
following residues were selected for analysis: His*! and 
Cys'*°, the catalytic dyad; His'®’, the substrate-binding resi- 
due; Tyr’®', residue for holding His'™ in place. Some sur- 
face residues at domain I, Ser® and Asn**, were also selected 
as control. In the Ramachandran plot (22), Asn** is in the 
disallowed region of the unbound 3CLpro but in a gener- 
ously allowed region of a 3CLpro-azapeptide inhibitor com- 
plex (23). Table 1 summarizes the kinetic parameters of these 
mutants. Mutation at catalytic dyad residues (H41A and 


TABLE 1 Kinetic parameters of the SARS-CoV main 
protease mutants 
Quaternary Km Keat Keatl/Km 
Protease structure (uM) (10° s~!) (mM! s~') 
Experiment tr 
3CLpro Dimer 105+ 3.1 85.5 + 7.1 8.1 + 2.5 
H41A* Dimer - 0.0002 = 
C145A* Dimer - 0.0002 - 
H163A Dimer 175 +42 014+ 0.01 0.008 + 0.002 
Y161A* Monomer/ = 0.006 - 
polymer 
S65A Dimer 320+9.9 96.2 + 143 3.0 = 1.0 
N84A Dimer 9.9+2.3 64.1 + 115 6.5 4 1.9 
3CLivn Monomer Npbt Npbt Npbt 
Experiment II® 
3CLpro Dimer 145445 52+ 0.7 0.4 + 0.1 
W31 Dimer 18.7+78 59+ 1.1 0.3 + 0.1 
W207/W218 Dimer 145+43 110+ 13 0.8 + 0.3 


*The enzyme activities for these mutants were too small to get reliable 
kinetic parameters, only relative activity was reported. 

tNot detectable. 

*Determined in phosphate-buffered saline (pH 7.6). 

SDetermined in 30 mM Tris-Cl buffer (pH 7.7). 
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C145A) abolishes almost the complete biological function 
while the enzyme exists exclusively as dimer. Assuming that 
the rate-limiting step of the hydrolytic activity of 3CLpro, 
like chymotrypsin, is at the covalent acylation-deacylation 
step, then the K,, value for the substrate reflects an apparent 
binding affinity for the enzyme-substrate complex. However, 
mutation at S1 site (H163A) of the substrate-binding pocket 
did not change the substrate binding appreciably as antic- 
ipated. The major effect was on the k.., value. H163A pos- 
sessed ~0.1% of enzyme activity compared with WT. In 
contrast, the catalytic dyad mutants, H41A and C145A, had 
only 10~° relative activity left. 

An interesting finding was the Y161A mutant, which has 
only 7 X 10~° relative enzymatic activity left. Y161A dis- 
sociated to an unstable monomer that is prone to polymerize 
(Supplementary Material, Fig. 3S). 


Reversible unfolding of SARS-CoV main protease 
in GdnCl 


SARS-CoV 3CLpro was denatured at high temperature. The 
thermal denaturation was irreversible. White coagulations 
occurred in the cuvettes when the enzyme at 0.3-0.4 mg/ml 
in phosphate-buffered saline (pH 7.6) was heated to 60°C for 
30 min (data not shown). This result was also observed in 
other studies (24,25). Thermal unfolding is thus not suitable 
for further analysis of the unfolding process of 3CLpro. 

The fluorescence emission spectrum of the native enzyme 
shows a maximum at 328 nm (Fig. 2 A, curve a). The enzyme 
was very sensitive to GdnCl. In the presence of 6 M of GdnCl, 
the enzyme was completely unfolded in 10 min and the 
maximum emission fluorescence was shifted to 358 nm (Fig. 
2 A, curve c). The unfolding was found to be completely 
reversible. A simple 10-fold dilution instantaneously induced 
refolding of the enzyme to a yield of 92-95% as judged by the 
fluorescence spectrum recovery (Fig. 2 A, curve d). 

Similar results were obtained for the C-terminal domain- 
I-truncated enzyme (3CL;+1) in which only one buried 
tryptophanyl residue remained and thus very little fluores- 
cence was detected in the folded state (Fig. 2 B, curve a). The 
GdnCl-induced unfolding is thus very suitable for confor- 
mational stability studies of the enzyme. 


Unfolding curves of SARS-CoV main protease in 
various concentrations of GdnCl 


Unfolding of the enzyme in different concentrations of 
GdnCl is shown in Fig. 3. When the integrated fluorescence 
area was calculated at various GdnCl concentrations, a 
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monophasic unfolding process seems sufficient for describ- 
ing the data (Fig. 3 C). However, there is a strong denaturant- 
dependent pretransition slope, which might indicate exposure 
of hydrophobic areas (26). Similar results were obtained with 
CD spectra (Fig. 3 D). More interesting results are shown in 
Fig. 3 B, which shows clearly a biphasic red shifting of the 
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FIGURE 3 Unfolding curves of the full-length (3CLpro) SARS-CoV main 
protease in different concentrations of GdnCl. The enzyme, 3CLpro, 5 wg/ml 
(A-C) and 0.35 mg/ml (D) in 30 mM Tris-Cl (pH 7.7) was mixed with various 
concentrations of GdnCl. The fluorescence (A—C) or CD spectrum (D) was 
recorded after 10 min incubation at 30°C. (A) Average fluorescence emission 
wavelength (AEW) (Eq. 2). (B) Fluorescence emission wavelength shift. (C) 
Integrated fluorescence area. (D) Circular dichroism changes at 222 nm versus 
GdnCl concentration. Solid symbols are experimental data and the smooth 
curves are computer-generated best-fitted lines according to a three-state 
unfolding model (A and B) (Eq. 8) or a two-state unfolding model (C and D) 
(Eq. 7). The fitting residuals are also shown under each panel (open circles). 
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fluorescence emission peak from 328 to 358 nm upon ad- 
dition of GdnCl. The average emission wavelength (Fig. 3 A) 
that registered changes in both fluorescence wavelength and 
fluorescence intensity was used to calculate the thermody- 
namic parameters of the unfolding/refolding process (Table 2). 

The folded enzyme started to unfold beyond ~0.4 M GdnCl 
and reached an unfolding intermediate with [GdnCl]o.5.n-1 at 
1.1 M. The second unfolding phase started at ~3 M denaturant 
with [GdnCllo.51u = 4.0 M, corresponding to free energy 
change (AG) of 1.22 + 0.16 kcal/mol, and 10.88 + 0.27 kcal/ 
mol, respectively, for the N < J and/ « U processes. 


Quaternary structural changes of SARS-CoV 
main protease in the presence of GdnCl 


The above results indicate clearly a biphasic unfolding curve 
for the dimeric enzyme. It is ultimately important to differ- 
entiate between dissociation versus unfolding under GdnCl. 
Analytical ultracentrifugation was used to examine this point. 
Fig. 4 A shows a typical sedimentation velocity profile for the 
enzyme in the presence of 0.4 M GdnCl. As shown in Fig. 4, B 
and C, the enzyme already dissociated to monomers at 0.4 M 
GdnCl. The major species of 4.6 = 0.2 S corresponds to a 
dimer in the absence of GdnCl (Fig. 4 C), whereas in 0.4 M 
GdnCl the major peak corresponds to a monomer with a value 
of 2.9 + 0.2 S. An equivalent continuous distribution C(s,M) 
plot (not shown) indicated the dissociation of dimeric 3CLpro 
with molecular mass of 61 + 17 kDa in buffer to a monomer 
with molecular mass of 33 + 6 kDa in 0.4 M GdnCl. The 
biphasic-unfolding phenomenon is thus not a dissociation- 
unfolding process. Fig. 3, A and B, then represents a folding/ 
unfolding process of the dissociated monomer. 


Correlation of quaternary structural changes of 
SARS-CoV main protease with enzymatic activity 


We then examined the enzymatic activity change upon 
GdnCl unfolding. The results shown in Fig. 5 indicate the 
close correlation of enzymatic activity loss with dimer disso- 
ciation. It should be noted that the protein concentrations 
were different in these experiments. The amount of enzyme 
used in activity assay (5 jg/ml) is much smaller than that 
used in AUC experiment (0.5—-0.85 mg/ml). However, this 
still suggests that dissociation induces enzyme inactivation. 


Possibility of unfolding intermediate as a molten 
globule state 


After dissociation, the conformational changes of 3CLpro 
were clearly demonstrated by the increase in frictional ratio 
of the enzyme upon addition of GdnCl (Fig. 5). Analytical 
ultracentrifugation provides very detailed conformational 
information about the enzyme structural changes. Significant 
unfolded intermediate (15%) with frictional ratio of 2.67 was 
detected at 2 M GdnCl. This unfolded intermediate form was 
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TABLE 2 Stability parameters of SARS-CoV full-length and C-terminal truncated main proteases 


Protease AGy. (kcal mol!) my, (kcal mol~! M7!) 
3CLpro* 1.22 + 0.16 1.15 + 0.11 
wait 1.39 + 0.46 0.91 + 0.22 
w207/W218" - - = 
3CLivn * 7.27 + 0.35 4.94 + 0.23 


[GdnCl]o.s,.n-1 (M) 


1.1 + 0.2 
1.5 + 0.6 


15 + 0.1 


AG, (keal mol™') my (Keal mol~! M7!) [GdnCl]o.5,1.5 (M) 


10.88 + 0.27 2.70 + 0.07 40+ 0.1 


10.25 + 0.47 2.49 + 0.12 4.1 + 0.3 


*The unfolding data were fitted to a three-state unfolding model according to Eq. 8. 
The unfolding data were fitted to a two-state unfolding model according to Eq. 7. 


not detected at 0.4 M or 4 M GdnCl, in which the enzyme 
existed exclusively as folded monomers or unfolded mon- 
omers, respectively. 

The above observed unfolding intermediate may represent 
a molten globule state with exposure of the interior hydro- 
phobic area, which will have pronounced binding ability to 
ANS and have a higher propensity to aggregate (27). How- 
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FIGURE 4 Quaternary structural changes of SARS-CoV main protease in 
GdnCl. Continuous sedimentation analysis of the SARS-CoV main protease 
in the presence and absence of GdnCl. (A) Sedimentation of the enzyme in 
30 mM Tris-Cl (pH 7.7) containing 0.4 M GdnCl. The sedimentation 
velocity data were analyzed with SEDFIT (17). The fitting residuals are 
shown under the panel. (B) C(s, f/f.) distribution of the enzyme in 0.4 M 
GdnCl. (C) C(s, f/fo) distribution of the enzyme in the absence of GdnCl. 


ever, no precipitation was observed for 3CLpro at any of the 
GdnCl concentrations. Neither was polymerization detected 
by AUC. ANS binding ability of the GdnCl-unfolded en- 
zyme remained unchanged (Supplementary Material, Fig. 4S). 
This probably excludes a gross conformational change of the 
enzyme and the major intermediate form detected at 14 M 
GdnCl may not represent a partially unfolded molten globule 
state. The remaining possibility is then differential domain 
stability. The unfolding of 3CL;+1, was then examined. 


Unfolding curves of the C-terminal domain-truncated 
SARS-CoV main protease (3CL),1) 


In the presence of GdnCl, 3CL;+1, follows the simple 
monophasic-unfolding curve (Fig. 6), which is also com- 
pletely reversible (Fig. 2 B). Compared with the unfolding 
curve of the full-length 3CLpro, it is very likely that the 
biphasic unfolding of 3CLpro represents differential confor- 
mational stability of domain (I + ID) and domain III. 

This helps us to differentiate domain stability of 3CLpro 
and assign the functional role of domain III. However, it is 
still difficult to assign the corresponding domain to the 
biphasic unfolding. To provide a structural basis for the 
biphasic unfolding curves shown in Fig. 3, A and B, we 
constructed two mutants of 3CLpro, each with tryptophanyl 
residue(s) mutated at different domain and the fluorescence 
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FIGURE 5 GdnCl-induced dissociation and unfolding of SARS-CoV 
main protease. (Open circles) Remaining enzyme activity; (open triangles) 
remaining dimer percentage. Unfolding of the enzyme was demonstrated 
by the contour of continuous sedimentation coefficient and frictional ratio 
C(s, flf>) with the most concentrated region shown in red. The x axis of 
these contours has no physical meaning. 
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FIGURE 6 Unfolding curves of the C-terminal domain-truncated mutant 
(3CLy41) SARS-CoV main protease in different concentrations of GdnCl. 
The 3CL}+ 11 enzyme (13.3 yxg/ml in panels A-C, and 0.18 mg/ml in panel D) 
in 30 mM Tris-Cl (pH 7.7) was mixed with various concentrations of GdnCl. 
The fluorescence (A—C) or CD spectrum (D) was recorded after 10 min 
incubation at 30°C. (A) Average fluorescence emission wavelength (AEW) 
(Eq. 2). (B) Fluorescence emission wavelength shift. (C) Integrated fluo- 
rescence area. (D) Circular dichroism changes at 222 nm versus GdnCl 
concentration. Solid symbols are experimental data and the smooth curves are 
computer-generated best-fitted lines according to a two-state unfolding model 
(Eq. 7). The fitting residuals are shown under each panel (open circles). 


of these mutants now reports the structural changes of in- 
dividual domain. 


Unfolding curves of the tryptophanyl 
residue(s)-mutated SARS-CoV main protease 


3CLpro has a total of three tryptophany! residues (Trp*! at 
domain I, ‘Trp and Trp’'® at domain III, domain II has no 
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Trp) (Fig. 1 A). W31, a double mutant with both Trp*”” and 
Trp7!® of domain III mutated to phenylalanine and only one 
tryptophanyl residue (Trp*') remained. When excited with 
295 nm UV light, the protein intrinsic emission fluorescence 
came from the tryptophanyl residue only. Thus W31 will 
only report the structural information of the N-terminal 
domain. Similarly, W207/W218, a single W31F mutant, will 
register the conformational change of the C-terminal domain. 

Mutation at these tryptophanyl residues to phenylalanine 
did not change the kinetic properties of the enzyme. The kat 
and K,, values of W31 or W207/W218 were in the same 
order as in WT. The AUC data indicated that these Trp 
mutants still existed as dimer in solution (Table 1). When 
W31 and W207/W218 were unfolded in GdnCl, the fluore- 
scence signal showed monophasic in both cases but distinct 
unfolding curves of these mutants were obtained (Fig. 7). It 
is clear that the N-terminal domain of 3CLpro (W31) is 
unfolded ([GdnCl]o.5 ~ 1 M) before the C-terminal domain 
(W207/W218) ([GdnCl]o.5 ~ 4 M). The unfolding param- 
eters of 3CL;+_ thus represented the first phase of the 
3CLpro (Table 2). 
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FIGURE 7 Unfolding curves of the tryptophanyl residue(s)-mutated 
SARS-CoV main protease in different concentrations of GdnCl. The mutant 
3CLpro, W31 (40 wg/ml) (A) or W207/W218 (18 g/ml) (B), in 30 mM 
Tris-Cl (pH 7.7) was mixed with various concentrations of GdnCl. The 
enzyme solution was excited at 295 nm, and the emission fluorescence was 
recorded at 30°C. Average fluorescence emission wavelength (AEW) (Eq. 2) 
of each mutant was used to present the unfolding data. (A) W31 (or more 
precisely W207F/W218F) represents the structural change of N-terminal 
domain. (B) W207/W218 (or more precisely W31F) represents the structural 
change of C-terminal domain. Solid symbols are experimental data and the 
smooth curves are computer-generated best-fitted lines according to a two- 
state unfolding model (Eq. 7). The fitting residuals are shown under each 
panel (open circles). 
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DISCUSSION 


Some of the active site residues of SARS-CoV and related 
coronavirus main proteases have been explored previously. 
Thus Cys'*° and His'®* have been mutated in 3CLpro of 
transmissible gastroenteritis (corona) virus (9); His", Cys! ; 
Tyr’! in feline infectious peritonitis virus (28); and Cys'* 
in SARS-CoV (10,13,29). Table 1 summarizes the results 
obtained in this study. As expected, the almost complete loss 
of enzymatic activity of H41A and C145A confirmed the 
catalytic roles of the catalytic dyad (29,30). Similar obser- 
vations were made for His'® and Tyr’! Both of these 
residues are involved in substrate binding. The main function 
of Tyr'®! is to hold His'® in place. Mutation of His'® led to 
a relatively small increase in K,, but a large drop in Kea 
(Table 1). This could result from the enzyme activity assays 
being performed at pH 7.6, where one protomer of the 
dimeric enzyme adopts a different conformation from that in 
acidic environment (7). It has been postulated by Tan et al. 
(30) and also by Yang et al. (7) that at low pH, protonation of 
His'®? induces a series of conformational changes that lead 
to the collapse of the oxyanion hole and blocks the SI pocket 
entry. This would be in full agreement with the observation 
of a very large decrease in k.,, as shown in Table 1. An 
intriguing question will then be testing the pH-activity 
profile of H163A mutant to confirm that His'®? is responsible 
for the lower activity of 3CLpro at pH 6.0 as predicted by 
molecular dynamic simulation (30). 

The contribution of this study includes our presentation of a 
clear-cut dissociation-denaturation profile as well as a differ- 
ential domain unfolding process. Protein folding constitutes 
one of the most challenging problems in the life science area 
(31-35). The SARS-CoV main protease has a unique 
structure with distinct structural domains. Each subunit of 
the protein has 13 strands and 11 helices, distributed in 
different structural domains (7). A multiphasic unfolding 
process may be anticipated. From the present analyses, the 
differential stability of different structural domains of the 
enzyme was deduced in terms of their free energy changes 
during transformation from the folded state to the unfolded 
state with an intermediate state involvement of folded domain 
If but unfolded domain (I+ ID by the following reasoning. 

Bacha et al. (24) had demonstrated the intrinsic instability 
of the C-terminal domain II by differential scanning 
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calorimetry albeit this technique seemingly does not provide 
enough resolution to detect the unfolding intermediate. We 
observed a clear biphasic unfolding curve for the 3CLpro by 
the protein intrinsic fluorescence. The intermediate form was 
assigned to a folded C-terminal domain but an unfolded 
N-terminal domain on the basis of the following observa- 
tions: a), Our unfolding data of 3CLpro clearly demonstrate 
the biphasic-unfolding process and our data of 3CLy+1 
(N-terminal domain) suggest attributing the first and second 
phases of the 3CLpro to the N- and C-terminal domains, 
respectively (Table 2). b), Unequivocal assignment of the 
biphasic unfolding curve to differential domain stability 
came from results shown in Fig. 7. The W31 and W207/ 
W218 are full-length 3CLpro. The unfolding curves shown 
in Fig. 7 should be biphasic. However, because the protein 
was excited with 295 nm UV light, one of the domain 
unfoldings is invisible and the unfolding curve only regis- 
tered the structural changes of the tryptophan-containing 
domain. W31 and W207/W218 thus report conformational 
changes of the N-terminal and C-terminal domains, respec- 
tively. It is then clear that the N-terminal domain is labile 
than the C-terminal domain. The stabilization of the catalytic 
N-terminal domain by the C-terminal domain III is thus 
confirmed. The intrinsic conformational stability of domain 
(+I) without domain II has a much higher free energy 
(7.27 kcal mol‘) than full-length 3CLpro (1.22 kcal mol™ 4) 
(Table 2). The significant larger mganc; value of 3CLy+1 
(4.94 kcal mol”! M~') than full-length 3CLpro (1.15 kcal 
mol ' M~') clearly indicates the pronounced increase of 
surface area exposure on denaturation of 3CL;+1 (26). This 
actually defines the functional role of domain III in SARS- 
CoV main protease. 

After dissociation, the distribution of three structural forms 
of 3CLpro in various GdnCl concentrations is very clear. It 
includes a folded monomer, an intermediate with folded 
domain II but unfolded domain (I+ID, and a completely 
unfolded form. The enzyme starts to dissociate at very early 
stage, at GdnCl concentration of <0.1 M. The dissociated 
monomer then starts to unfold at <0.4 M and reaches the 
first plateau at 2-3 M. The second unfolding then starts, 
which reaches another plateau at 4.5 M GdnCl. The complete 
dissociation and differential unfolding of the structural do- 
mains are summarized in Fig. 8. This schematic presentation 


A. Dissociation HS: Ra al N-I Unfolding I-U Unfolding 
rox mm 5 % Fe > <_————_s: [POR _————; ( § BRS 
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Folded Dimer 


Unfolded Intermediate Unfolded Monomer 


FIGURE 8 A schematic model for the dissociation and unfolding of SARS-CoV main protease in GdnCl. The stepwise dissociation and unfolding of the dimeric 
3CLpro are shown schematically with N-terminal domain in red and C-terminal domain in blue. Unfolding portions of the polypeptide are shown in red thread. 
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only lists the major enzyme species of the complex folding 
process. The sharp unfolding curves for 3CL;+1 (Fig. 6) and 
the completely reversible of the unfolding process (Fig. 2 B) 
consistent with that 3CL,+;,, without domain III can fold 
independently into an intact chymotrypsin-like fold. Domain 
Il thus must have extra functional role. 

Shi and Song (36) have observed a nanochannel between 
domain III and the active center and proposed a regulatory 
effect of the extra domain III in relaying the catalytic ma- 
chinery. Chen et al. (21) suggest that the enzyme follows an 
association-activation-catalysis-dissociation catalytic cycle. 
This will provide further insight into the functional role of 
domain III in the dimerization and involvement in the en- 
zyme activity regulation beyond simple stabilization of the 
catalytic N-terminal domain. 

In summary, both 3CL +1; and 3CLpro show a completely 
reversible unfolding/refolding process in GdnCl. We dem- 
onstrate here a clear-cut case of the differential stability of 
quaternary and domain structures. The unfolding studies of 
SARS-CoV main protease may shed light on the inherent 
structural stability of this important viral enzyme. 


SUPPLEMENTARY MATERIAL 


An online supplement to this article can be found by visiting 
BJ Online at http://www.biophysj.org. 
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